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ABSTRACT 

Attempts to constrain the Hubble constant using the strong gravitational lens system Q0957+561 
are limited by systematic uncertainties in the mass model, since the time delay is known very pre- 
cisely. One important systematic effect is the mass sheet degeneracy, which arises because strong 
lens modeling cannot constrain the presence or absence of a uniform mass sheet k, which rescales Hq 
by the factor (1 — k). In this paper we present new constraints on the mass sheet derived from a 
weak-lensing analysis of the Hubble Space Telescope imaging of a 6 arcmin square region surrounding 
the lensed quasar. The average mass sheet within a circular aperture (the strong lens model region) is 
constrained by integrating the tangential weak gravitational shear over the surrounding area. We find 
the average convergence within a 30" radius around the lens galaxy to be k(< 30") = 0.166 ± 0.056 
(Icr confidence level), normalized to the quasar redshift. This includes contributions from both the 
lens galaxy and the surrounding cluster. We also constrain a few other low-order terms in the lens 
potential by applying a multipole aperture mass formalism to the gravitational shear in an annulus 
around the strong lensing region. Implications for strong lens models and the Hubble constant are 
discussed in an accompanying paper. 

Subject headings: 



1. INTRODUCTION 

Gravitational lensing allows for a measurement of the 
Hubble constant Hq that is independent of the "distance 
ladder" and is not sus ceptible to th e peculiar velocities 
of the local universe (lR,efsda]lfT96l . This unique op- 
portunity is available for special lenses strong enough to 
generate multiple images from a single source, and when 
this source has intrinsic variability, such that a differ- 
ential time delay can be observed between the multiple 
images. 

The doubly-imaged quasar Q0957-1-561 is the first 
confirmed example of strong gravitational lensing [SL] 
(jWalsh et al\ Il979l ). The quasar is variable; with the 
time delay accu rately deterniined to < 1% between 
the two images (iKund ic et al\ IT997I: iCollev efoll 120031: 
IShalvapin et a/.|[2008f) . the lack of precision in obtain- 
ing Ha from this system in past measurements (±35%, 
2a) lies in determining the mass distribution o f the lens 
(|Bernstein fc Fischeii[l999l : iKeeton et aZ.ll2000f) . 

One well-known problem in uniquely determining the 
mass distrib ution (and hence JJq) is the "mass-shee t 
degeneracy" (|Falco o/.lll985t iGOTenstein et a/]|1988[ ). 
The lens mass is modeled given the multiple images as 
constraints; the lensed light is traced back from the mul- 



tiple image according to a given lens model, and checked 
for a consistent, single source. Unfortunately, the true 
angular position of the source is not observable and hence 
not constrained; the mass-sheet degeneracy occurs be- 
cause the angular position of the source object is degener- 
ate wit h the constant surface mas s density, or the "mass 
sheet" (jBernstein fc Fischeijll999t ). Thus, the mass sheet 
manifests itself as a uniform magnification of the (unob- 
servable) source plane. 

This degeneracy in the SL model can be lifted if the av- 
erage mass overdensity in the SL model region is known. 
This is done by using the aper ture-mass weak g ravita- 
tional lensing [WL] technique (jKaiser fc Sauire£l ll993'). 
WL generates a subtle but coherent distortion of the 
background galaxies, which can be used to infer the 
mass distribution. In particular, the integrated tangen- 
tial shear around a circular aperture non-parametrically 
determines the two-dimensional average mass overden- 
sity within the aperture. 

The WL measurement suffers from its own mass-sheet 
degeneracy, because the WL signal must formally be in- 
tegrated over the surrounding region out to infinity to 
obtain the mass distribution. While this is not practical, 
a realistic scheme to estimate the average mass sheet 
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in a SL region is to integrate the WL distortions over 
an annular region surrounding the SL model. WL then 
measures the differential mass sheet AR of the disk re- 
gion defined by the inner and outer circle of the annulus, 
where the average convergence within the large circle is 
small, and can be estimated from an assumed mass pro- 
file. The mass sheet fo r Q0957-I-561 has p reviously been 
measured in this way (iFischer et aZ.lll997t ). 

There are other sources of degeneracy in obtaining Hq 
from SL. The Q0957-I-561 lens is a galaxy which sits 
within a modest cluster, so the lens mass model must 
include both sharply and smoothly varying terms, corre- 
sponding to that of the galaxy and cluster, respectively. 
The cluster potential can be expanded in Taylor series; 
the second-order terms (the lowest order contributing 
to gravitational lensing) corresponds to the mass sheet 
and constant shear within the SL region. Past studies 
of the lens model have shown that the uncertainty in 
Hq measured from Q0957-I-561 is dominated by the de- 
generacy betw een the galaxy ellipticity and cluster shear 
(|Keeton et a/.l[2000. and references therein). For many 
models, the parametric fits tend to converge to a large 
cluster shear (7c ~0. 1-0.3), while the quasar host galaxy 
lensed images seem to imply a s mall shear (7e < 0.1) 
(|Keeton et al\ \200(Pi. Additionallv. iKochaned i|1991h and 
subsequent papers have shown that the third-order terms 
cannot be neglected for the Q0957-I-561 cluster. 

The goal of this paper is to constrain the smoothly 
varying components of the lens, i.e., the mass-sheet and a 
few of the lowest-order cluster terms, using WL analysis. 
The convergence and shear pattern multipoles relative to 
an aperture are related to each o t her in a simple manner 
([Schneider fc Bartelmaniil 119971 : iBernstein fc Nakaiimal 
|2008() , and this relation has been applied to the newly ac- 
quired images from Hubble Space Telescope (HST) Ad- 
vanced Camera for Surveys (ACS). The combined lens 
model from strong lensing analys is of the system is dis- 
cussed in an accompanying paper (|Fadelv efaLll2009D . as 
well as its implication for Hq. 



2. THEORETICAL BACKGROUND 
2.1. Time Delay and Hubble Constant 

When the gravitational field of a lens object is strong 
enough, the light path can be sufficiently perturbed such 
that it generates multiple images of a single light source. 
The bending of two or more separate light paths arriving 
at the observer is determined from the lens configuration; 
i.e., the distances between observer, lens and source, as 
well as the mass distribution of the lens. The SL observa- 
tions consist of (dimensionless) angular separations, red- 
shifts, and relative magnitudes of the multiple images, 
and have no distance or mass scales directly associated 
to them. A measurement of the time delays between the 
multiple images provi de an absolute time/distance scale 
to the lensing system (' Refsdallll964|) . 

Two light paths originating from the same object un- 
dergo different time delays due to differences in (a) their 
overall path lengths, and (b) the depth of gravitational 
potential as they cross the lens. The time delay (with 
respect to a straight path from /3, the true, unobserved 
angular position of the source, with no intervening lens) 
for a single light path traveling through 9 in the lens 



plane is 
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where the delay due to the path difference is proportional 
to (0-/3)^/2, and the other term represents the Shapiro 
time delay, where ip is the dimensionless, projected and 
scaled gravitational potential of the lens: 
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Here, $ is the ordinary 3D Newtonian potential, £ is the 
distance along the line-of-sight, and D indicates angu- 
lar diameter distances between observer (O), lens (L) 
and source {S). The absolute delay t{6) is not observ- 
able, but the difference At = t{6i) — t{62) is, where 
cAt is proportional to H^ from the definition of angu- 
lar diameter distances. Hence, we see that the Hubble 
constant Hq can be obtained from the geometry of the 
lens system if At is measured and the lensing potential 
i^{0) is known. In general, V' is obtained fro m fits to pa- 
rameterized lens m odels (e.g.. IBernstein fc Fischeilll999l : 
iKeeton et al\ |2000() . where the best fit to the strongly 
lensed images is chosen. 

2.2. Mass Sheet Degeneracy 

A fit to a parameterized lens model il) does not provide 
us with a unique solution, however. A uniform mass 
sheet cannot be constrained by (and hence not included 
in) the SL models. Specifically, for a lens potential V'(^) 
and source position /3 that satisfies the SL constraints. 



(3) 

/3' = (l-ACo)/3 (4) 

also satisfies the constraints, where kq is the constant 
convergence (see Eq.®) from the additional uniform 
mass sheet. Under this degenerate transformation, the 
time delay is 

(At)' = (1 - Ko)At, (5) 
requiring the Hubble parameter to rescale by Hq = (1 — 

Kq)Hq. 

The lensing convergence k{0) is a normalized surface 
density, and is defined as 



V2^(6») = 2k(0) = 2 
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where S(0) is the surface mass density, and the critical 
surface density is determined from the lens and source 
redshifts, and the assumed cosmology: 

_ Dos , . 
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A SL region which can generate multiple images has 
E <L Scrit, OT K ^ 1, while WL techniques are generally 
valid only in a region where k <C 1. 

To remove the (1 — kq) degeneracy in Hq, one must 
obtain the uniform mass sheet kq not included in the 
SL model. The overall average mass sheet (k) can be 
constrained via WL, which constrains the sum of iish 
and Kq, where Rsh is the average convergence of the SL 
model lens mass distribution. The WL information is 
obtained over a field wider than the area where ■(/'(^) is 
modeled, and its procedure is described in ^4.61 
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2.3. Cluster Model 

The Q0957+561 lens system consists of the primary 
lens galaxy Gl at redshift zicns = 0.355, the relatively 
weak cluster which contains Gl, and the quasar itself 
which lies at Zgrc = 1.41. The lens model in the SL anal- 
ysis consists of a mass concentration with near elliptical 
symmetry representing the galaxy, with additional clus- 
ter potential components that are Taylor expanded in 
position to the thir d order (|Bernstein fc Nakaiimal[2008l : 
iFadelv et a/.l 120091) . The expansion is defined within 
r < RsL centered at the central peak of Gl, and is fully 
general to order {t/Rsl)^, where Rsl is the radius of the 
circular boundary where the SL modeling takes place: 



ipir) = (1 - Kc)x 
V'g(r) -HRe 



7c 2 -2i. 

— r e 
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where r = {x,y) = (r, is the transverse distance in 
the lens plane (and is equivalent to 6 in the previous sec- 
tions), ipg{r) is the galaxy potential, and Kc, 7c, fc and Sc 
are the (complex) constants corresponding to the cluster 
mass sheet, constant shear, internal mass dipole moment, 
and m — 3 shear moment, respectively. The complex no- 
tation corresponds to the two-dimensional description of 
the gravitational potential such that, e.g., the derivative 
is 

d . d f d i d 



d 



dx dy 
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The potential and the mass sheet always real. The 

first four terms in the square bracket are those modeled 
in SL, and are multiplied by (1 — Kc) due to the mass 
sheet degeneracy associated with the analysis; Kc is the 
mass sheet that is not part of the SL model, and cannot 
be constrained from SL model within r < Rsl alone, 
due to degeneracy (i i2.2|) . The four quantities Kc, 7c, 
(Tc and Sc are constrained by WL measurements from 
data in Rsl < r < i?max and the multipole formalism 
(i j4.6p . The inner and outer annular radii were chosen 
to he Rsl ~ 30" and -Rmax = 186" for our analysis, 
such that we consider a region sufficiently removed from 
the SL region, but with reasonable WL signal from the 
cluster. 

3. OBSERVATIONS AND DATA 
3.1. EST/ ACS Imaging 

The images for the cluster around the double quasar 
Q0957-h561 were taken as a 2 x 2 mosaic of HST ACS 
pointings of the Wide Field Channel (WPG) detector on 
October 10-13, 2005 (Fig. [J). Each ACS WPG pointing 
has 202" X 202" field of view (FOV), resulting in a 6' x 6' 
mosaic image. Each pointing has 4 orbits (4 x 1920s) in 
the F606W (Broad V) filter and 2 orbits (2 x 1880s) in 
the F814W (Broad I) filter; the mosaic has the central 
30" region overlapped such that the region around the 
quasar images have four times the depth (30ks and 15ks 
for F606 W and F814W, re spectively) for studies of SL 
features (|Fadelv et a/.ll2009f ). 

The HST WFG is a mosaic of two 4096 x 2048 
pixel charge-coupled devices (CCDs) of approximately 




Fig. 1.— The 6' x 6' combined image of the 0957+561 field in 
ACS F606W. Each quadrant is a single pointing (with pixel scale 
dithering) which has been imaged to a depth of 7.5ks, while all 
four quadrants overlap in the central 30" region to yield a depth 
of 30ks. The chip gap is apparent in each of the quadrants. The 
orientation and the image scale are as labeled. The apparent shear 
in the FOV is from correcting for the image distortion due to the 
ACS focal plane located off-center from the telescope axis. 

0.05"/pixel. Each orbit is split into four dithered expo- 
sures to sample the diffraction-limited point-spread func- 
tion (PSF) and to remove defective pixels and cosmic 
rays in the combined image. 

3.2. Object Catalog 

We created a.n initial stacked mosaic image using 
MultiDrizzle (jKoekemoer et al\ l2002f) . where we re- 
fine relative shifts between individual exposures by cross- 
correlating the positions of compact sources in overlap- 
ping regions. O bjects were detected in this image us- 
ing SExtractor (|Bertin fc Arnoutsl ll996Y We took the 
initial SExtractor detection parameters and refined the 
centroid, size and shape of each object using a native 
GLFit iNakaiima fc BernsteinI 120071) . The refined 

parameters determined here are still those which are con- 
volved with the PSF, therefore the size-magnitude dia- 
gram allows us to determine the stellar locus (Figure [5]). 
The stellar objects have an average full-width half max- 
imum (FWHM) of 0.13" in F606W, or a corresponding 
a = 0.054", where a is the characteristic width of the 
best-fit gaussian. 

Once the stars and galaxies were separated, the true 
(pre-PSF) galaxy shapes were obtained by performing 
a deconvolution GLFit, based on an interpolated PSF 
(i i|4.2[) . Both the native and deconvolution GLFit are 
done as a "multifit" — an individual object is fitted simul- 
taneously over pixel data from each exposure rather than 
from the combined image — hence the measured shapes 
are not affected by distortion or aliasing induced by the 
image-combining process (f: 



4. WEAK LENSING ANALYSIS 

4.1. EGL Method 

For our WL analysis, we employ the Elliptical Gauss 
Laguerre (E GL) method of shape measurement a nd shear 
estimation (|BJ02l : INakaiima fc BernsteinI |2007[ ) . imple- 
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Fig. 2. — Size-magnitude diagram of all detected objects from 
native GLFit. The boxed region shows a portion of the stellar locus 
with no saturation and no galaxy contamination. The stellar size 
a is approximately the size of the ACS WFC pixel, 0.054", where 
cr is the characteristic width scale for the best-fit gaussian (for 
objects with elongated shapes, a is the geometric mean of the two 
characteristic widths). The crosses are all detected objects, while 
the circles indicate objects which satisfy the size, magnitude, flags, 
and signal to noise (S/N)> 25 cut for use in WL analysis. The 
size and magnitude cuts are indicated by a dotted horizontal and 
vertical line, respectively. 

mented as GLFit; we provide here a brief description. In 
characterizing their shapes, star and galaxy images are 
decomposed into 2d Gauss-Laguerre (GL) basis functions 
(p3J02). In the EGL method, the GL basis functions 
are described upon a "basis elhpse," i.e., a sheared and 
stretched coordinate system such that a unit circle ap- 
pears as an ellipse of chosen size, shape or orientation; 
and so that the lowest order GL function is an elliptical 
2d Gaussian. The combination of the basis ellipse and 
coefficients to the GL functions then fully describes the 
image. 

The choice of basis ellipse varies depending on its pur- 
pose. In describing the PSF, we choose a common ba- 
sis ellipse: a circle with radius of the average PSF size. 
The PSF variation across the FOV can then be traced 
through the variation in the GL coefficients, and allows 
for a simple description of the necessary interpolation. 

In contrast, when measuring galaxy shapes, the basis 
ellipse is chosen such that (in the simplified case of no 
PSF smearing) the lowest order GL function is the best- 
matched 2d gaussian ellipse to the galaxy image. This 
basis ellipse then describes the shape (ellipticity) of the 
galaxy. In the observed image, the true galaxy shape 
has been convolved with the PSF. So our fitting does a 
simple deconvolution by modeling the true galaxy shape 
with a GL expansion, and adjusting the coefficients so 
that conv olution of this model with the PSF best matches 
the data ([Nakajima k Bernsteiiill2007f l. 

4.2. PSF Interpolation 

In order to obtain the true galaxy shapes, the PSF 
must be known at the position of the galaxy image. This 
is done by selecting the stars in the image and interpo- 
lating the shape at the galaxy position. There are two 
problems that make this difficult for our images. First, 
the PSF spatial variation is not stable over time: HST 
is known to "breathe" going in and out of the earth's 
shadow, causing its focus to vary on time scales of a sin- 
gle orbit (~ 1.5 hour). Hence it is unlikely that a single 
PSF spatial variation model would be valid for all ex- 



posures ()Schrabback et aLll2007l and references therein). 
Second, there are not enough stars (~20/chip) in our 
cluster image to create a reliable model of the PSF vari- 
ation across every exposure. 

To circumvent the aforementioned difficulties, we use 
publicly available ACS WFC stellar field images to cre- 
ate the models of the PSF variation for every exposure. 
A given PSF spatial pattern is highly reproducible, since 
the spatial variation of PSF patterns in different expo- 
sures is caused by the thermal breathing of the HST fo- 
cus. If the few PSF available on a given exposure can 
constrain a portion of the pattern, the rest of the PSF 
patte rn can be predicted (|Rhodes et al\ l2005t iJee et all 
[2001 . 

The noise in the interpolation is minimized by utilizing 
principal component analysis (PC A). Our procedure is as 
follows: 

1. Obtain the stellar field (SF) images available from 
the HST archive. There were 184 archival F606W 
exposures of dense stell ar fields; the criter i a for se- 
lection are described in ISchrabback et all ([2007f ). 

2. The PSF anisotropy is measured from each star in 
term s of the anisotropy kernel q (ISchrabback et all 
12007*). and its variation across the ficld-of-view is 
characterized by a third-order polynomial fit. 

3. We then choose, for each of our Q0957 exposure, 
the SF image that best matches the anisotropy pat- 
tern of the stars. The "best-fit" stellar field is iden- 
tified by that which yields the minimal 

^SF = ~ Imodcli^^ :yi)? 

i=l 

for a given exposure, where qi is the kernel for 
the ith star in our exposure (which have very few 
stars), and q^^^^i{xi,yi) is the interpolated kernel 
at the position of the ith star for the matching stel- 
lar field. 

4. For the PSF interpolation, obtain the PSF GL co- 
efficients by (a) dividing each stellar field exposure 
into a 8 X 8 grid, and then (b) fitting all stars within 
the grid to a constant PSF model for that cell, re- 
sulting in 64 PSFs per exposure (32 PSFs per chip). 
We choose to average the PSF in this way because 
the ACS WFC undersamples the PSF; by using 
multiple PSF images within the grid, the PSF is 
effectively "dithered" to obtain better sampling for 
the fit. We then have, for a given stellar field / 
and within every grid cell g, a GL coefficient vector 
b^f = {b(^} describing the PSF, where i runs over 
the GL indices. All PSF coefficients from {f,g} 
are described over a common basis ellipse, which 
we take to be a circle with radius equal to the av- 
erage size of the PSF. 

5. Extract the principal components b" = {bf} 
of the collection o f PSF GL coeflicients {h^s} 
([Jarvis fc Jainll2004[ ) . Express the PSF in terms of 
the PGA vectors, h^a = Ea/^^^b", where a runs 
over the PGA vectors. The PSF is now described 
in terms of PCA coefficients /3 — {Pa} instead of 
the GL coefficients b = {bi}- 
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6. Truncate the PCA coefficients by using only tlie 
major principal components which corresponds to 
the PSF (spatial and temporal) variation, but not 
to the noise. These components are identified by 
ranking the variance of each, and identifying a gap 
in the variance (see Fig. [3]). Most of the variation 
in PSF is described in the first several terms in a. 

7. Obtain a third-order, two-dimensional polynomial 
fit across the image for the relevant PCA compo- 
nents a for each field /, such that the interpolated 
PCA coefficients are described as 

0<(n+m)<3 

where = Y^giPL^ - Paf is minimized, c;^ 
is the polynomial coefficient for a in field /, and 
{xg,yg) are the grid center coordinates. Each PCA 
description of the PSF on every grid [13^^) was vi- 
sually inspected; obvious failures in the PSF de- 
scriptions (due to lack of sufficient stars within the 
grid) were eliminated before being used in this fit. 

8. The PSF at any location [x, y) over any field / can 
then be modeled using the known principal com- 
ponents and its 2-dimensional polynomial fit coef- 
ficients across the given exposure. 

Figure [3] shows the variance of each of the PSF prin- 
cipal components, for each WFC ACS CCD. The zeroth 
component is the average PSF shape over all grid cells 
and exposures. We find that the first seven components 
constitute the primary variance and are sufficient to de- 
scribe the PSF variation; from visual inspection, the 
residual is consistent with ~ 1% noise with respect to 
the peak. These seven components are used to model 
the PSF across the exposure. 

4.3. Galaxy Multifit 

Once the PSF is known at a given galaxy location, 
we perform the deconvolution GLFit to obtain the true 
galaxy shape. We utilize the multiple exposures taken 
of each galaxy without combining them. The "multifit" 
technique performs simultaneous fitting over individual 
exposures, each with distinct PSFs, assuming a single 
"true" galaxy model, and the fitting done with the con- 
volved GL basis functions of each exposure to their re- 
spective PSFs. The simultaneous fitting is iterated over 
the basis ellipses to obtain the best-fitting true galaxy 
shape. 

The pixel image is distorted with respect to the true 
image, so distortion correction is necessary within the 
multifit procedure. The galaxy model, described in sky 
coordinates, is fitted to the pixel fiux information via 
a pixel-to-sky coordinate map, which corrects for op- 
tical distortions in the pixel image. The pixel map is 
based on a known, stable solution of 4th order polyno- 
mial in combina, t ion w ith a supplementary look-up table 
()Andersonll2002l 120061 ). to which a linear correction (ro- 
tation, translation, shear and unifor m scale) is applied to 
fit to the USNO-B catalog (Monet et aLll2003D for align- 
ment and absolute astrometry. Bad pixels, such as those 
affected by cosmic rays or saturation, are not used in 



the fitting procedure. The bad pixel masks for each ex- 
posure were generated u sing the multidrizzle package 
i|Koekemoer et aLll2002D . 

4.4. Galaxy Selection for Weak Lensing 

Once the object catalog has been generated, we select 
the galaxies for WL analysis by imposing the following 
cuts: 

• Size. The stellar contamination is removed by 
choosing objects which have characteristic size a 
which are > 1.5 times that of the PSF. Our 
cut is conservative to ensure PSF deconvolu- 
tion to be in the range wh ere it performs well 
(|Nakaiima fc Bernsteinil2007l ). 

• Significan ce (S/N). From our performan ce analysis 
of GLFit (|Nakaiima fc BernsteinI 120071 ). we know 
that the deconvolution starts to produce biases be- 
low S/N of 20 to 40, depending on the galaxy size 
with respect to the PSF. Here we choose the cut to 
be S/N > 25. 

• Magnitude. Since we only want the background 
galaxies to the cluster for the lensing analysis, 
foreground cluster member contamination must be 
minimal. Lacking redshift information, and un- 
able to utilize color information for this purpose 
for reasons listed below, we cut the brightest ob- 
jects (mpeoew < 24) as foreground. By plotting 
the galaxy number density as a function of the ra- 
dius from the central brightest galaxy of the cluster 
(Figured]), we estimate that there is at most 10% 
cluster member contamination for r < 75" from 
the cluster center for the tof606W > 24 objects. 

• Flags. Any object whose GLFit fiags indicate that 
the shape has not been measured (an unsuccessful 
fit to the basis ellipse) is rejected from the catalog. 

After these cuts, 1866 galaxies remain for our WL anal- 
ysis, or galaxy number density of 50 arcmin"^ (Fig- SI)- 
We have not included a color cut (for a rough exclusion 
of cluster members) for the following reason: Although 
several of the red-sequence cluster members of known 
redshifts have the predicted F606W — F814W color of 1.0 
(Bruzual & Chariot 1993) in our color-magnitude plot, 
no peak was found in the galaxy count at this color. 
This is because (1) majority of the cluster members are 
blue, and (2) the expected member galaxy count in this 
halo is small. The first point is verified from > 50% 
of cluster members of known redshifts which are bluer 
than red-sequence, with F606W — F814W colors in the 
range 0.3-0.8. Man y of these galaxies were found t o have 
spiral morphology i|Angonin-Willaime et aLlll994l ). The 
Q0957 cluster mass (and hence the number of its lumi- 
nous me mbers) is expected to b e low based on the cluster 
richness () Johnston et aLir2007l ). The richness N20V1 is de- 
fined by the number of member galaxies which are con- 
sistent with the red-sequence color, is within a certain 
radius of the brightest cluster galaxy, and have luminos- 
ity above 0.4L* (see Koester et al. 2007, for details). Al- 
though their definition for iV2oo is given only to z = 0.3, 
we extend th eir red-sequence color c ut to z = 0.35 based 
on Table 1 of lEisenstein et all ()2001[ ) , and determine the 
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Fig. 3. — (a) The variance of each of the principal components from each of the CCD chips. The dotted hne indicates the last major 
gap in the principal component variance. There are seven principal components to the left of this line, (b) From top left to bottom right: 
the first ten principal components of PSF in chip 1. 

objects that are smaller, have low flux, require larger 
a 60 ^ T ^^^F606W ^ number of transfers to the readout register (i.e., objects 

located close to the chip gap in the ACS WFC), and 
exposures that have been taken at a later date. 

We correct for CTE effects using a parametric model 
given the flux, size, location on CCD and observation 
date to estimate the spurious elongation, and correct 
for it by removing this shear from each object in the 
shape catalog. The model was derived from galaxies in 
the HST/CO S MOS Survey similarly to the one used by 
iRhodes et al\ (|2007l ). but additionall y takes sky back- 
ground variations into account (see iSchrabback et al\ 
50 100 150 200 lin prep.l . for further details). 

r (arcsec) The mean correction averaged over all galaxies in our 

sample is (7+®) = -0.008 ±0.010, with the worst case 
correction for small, dim objects near the chip gap being 



Fig. 4. — Weighted galaxy number density as a function of ra- 
dial distance from the central galaxy of the cluster. The weighted 
number density for use in shear estimating (BJ02) were essentially 
identical to the straight-forward number density. The error bars 
are based on Poisson statistics. The dimmer mpgoew > 24 ob- 
jects are used for weak lensing analysis (see text), where both set 
of objects have the size, significance and flag cuts applied. The 
dotted lines are the average weighted galaxy density for the re- 
gion 75" < r < 186", where the density averages to 11.1 and 
49.2 arcmin"^ for the bright and dim galaxies, respectively. The 
brighter objects (mpgoew < 24) have an excess of ~ (30 ± 15)% 
in galaxy density for r < 75" from the cluster brightest galaxy 
Gl; the fainter objects (used for WL) appear to have an excess of 
~ (10 ± 10)%. 

equivalent richness to b e iV2no = 2 f rom their SDSS mag- 
nitudes ( Adelman-Mc Carthv aLllMog). This is lower 
than the lowest richness bin available in iJohnston et al\ 
(|2007f ). an indication that the Q0957 cluster is more of a 
group than a cluster, and hence that the member count 
is low. 

4.5. CTE Correction 

The galaxy shape is affected by the degradation of the 
charge transfer efhciency (CTE) in the ACS WFC CCD. 
When a charge is read out from a CCD pixel, some of the 
charge is trapped and released at a later time, causing 
the galaxy image to elongate along the charge read-out 
direction, adding an excess shear 'y'^'^ < along the pixel 
axis, where 7_|_ of > or < indicates elongation in the 
horizontal or vertical direction, respectively (see H4.6.2I 
for definition of the orthogonal shears 7+ and 7x ) . The 
effect of charge-trailing on the object shape is worse for 



up to 7!}, = —0.04. Given the 2x2 mosaic configura- 
tion, we expect the CTE effect on the measured shear 
multipole moments to vanish to first order for the even 
multipoles. Overall, we find the correction to the mass 
distribution to be negligible, of the order < 3% (well 
within the error bar, see ij5]) for the mass sheet, and 
within < 10% of la error bar for the multipole moments. 

4.6. Mass Sheet and Multipole Moments 

The two-dimensional multipole mass distribution 
within an aperture can be obtained in a non-parametric 
fashion from the WL shear information surrounding the 
aperture. The overall mass sheet (i.e., the combined aver- 
age mass sheet from the SL model and the unconstrained 
cluster term Kc) corresponds to the monopole. Here 
we su mmarize the results from iBernstein &: Nakaiimal 
(|2008h . 

4.6.1. Mass Sheet 

For any mass distribution, the expected azimuthally 
avera ged tangential shear at radius r from the mass cen- 
ter is (|Miralda-Escude|[l99l 



Itir) 



E(< r) - S(r) _ AE(r) 



-'crit 



k{< r) ~ R{r) (9) 



where r is the angular distance from the cluster cen- 
ter, 7t(r) is the tangential shear averaged at r, S(< r) 
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is the mean surface density within an aperture of ra- 
dius r, I](r) is the azimuthally averaged surface den- 
sity at radius r, and Scrit is the critical surface density, 
and AI](r) = Ecrit7t is the value often quoted in WL 
literature. The ratio of the surface density to Scrit is 
the convergence n. We can directly estimate the mass 
sheet within an apert ure using the aperture mass method 
(jFahlman et a/.|[l994l ): the average convergence within a 
circular aperture of radius R is 



R{< R) = 



1 



(frK{r) = 



1 



r<R 



(fr 



7t(i-) 



(10) 



r>R 



where the tangential sh ear is integrated over R < r < oo 
(jKaiser fc Sauireslll993f ). The average convergence gives 
the average surface density if the critical surface density 
Scrit is known. We discuss Ecrit estimation in §4.71 

The practical estimator for Eq. (jlOp is a summation 
over galaxy shapes 



k(< R) 



1 



It.: 



mrTZ 



(11) 



where n is the surface number density of source galaxies 
and ^t,i is the tangential component of the ith galaxy 
shape. Here we have assumed that the locally averaged 
galaxy shape (tangential component) is an estimator of 
the shear, ~ {ltA)/Tl, where TZ is the responsivity, 
the multiplicative correction factor for the estimation of 
shear from galaxy shapes (BJ02). 

In practice, the summation over all galaxies &t r > R 
is truncated at a maximum radius i?max- Equation (jlip 
then becomes 



k{< R) - k{< i?max) 



1 



mrTZ 



E 



(12) 



ie(i?Xr-<i?,,„„^) 



The truncation at i?max is a source of uncertainty; that is, 
only the difference in the mass sheet at different aperture 
radii can now be determined (this is the WL mass sheet 
degeneracy). We then assume an appropriate model to 
derive the mass distribution within -Rmaxj and hence k{< 
-Rmax)- We feel safe in doing so, since this correction is 
reasonably smaller than k(< R). 

If the WL measurement yields an average convergence 
of K within r = R (normalized to the quasar redshift) 
and the SL modeling (assuming Kc — in Eq. ([S])) yields 
an average convergence of ksl, then the relation of the 
degenerate mass sheet k^. to the measured mass sheet k 
is K = Kc + (1 - Kc)k.sl, or 



1 



1 



1 - KSL 



(13) 



4.6.2. Multipole Moments 

The method of aperture mass ("Fahlman etal\ [19941 : 
ISchneideij Il996t ISclineider fc Bartclmann 199^ which 
integrates tangential shear to obtain the mass sheet 
within an aperture, can be generalized to obtain the "in- 
terior" and "ext erior" multipoles defined relat ive to a cir- 
cle of radius R (jBernstein fc Nakaiimall2008[ ). The "in- 
terior monopole" corresponds to the mass sheet, and is 
constrained from the shear information exterior to r = _R. 
The other multipoles are constrained in a similar manner, 
where the shear information exterior to r = i? constrains 



the mass multipole moments interior to r = i?, and vice 
versa. 

In order to utilize the shear information to extract 
mass multipoles, first we define the complex shear rel- 
ative to the tangent to t he circle (see Eq. (11) of 
iBernstein fc Nakajimall2008[ ) 



r(r) = 7t(r) -I- i7s(r) = -7(i")e" 



(14) 



where the coordinate origin of r = (r, 0) is at the cen- 
ter of the circle, and 74 and 7^ are the (real) tangential 
and "skew" ^ shear components with respect to the polar 
coordinates, respectively. 7(r) = 7+(r) -I- (r) is the 
complex shear defined in the (a;, y) coordinates, where 7+ 
is the shear along the x- or y-axis, and 7x is the shear 
along the diagonal halfway between the axes. 

The interior and exterior mass multipole moments rel- 
ative to a circle of radius R are defined as 



Jr<R 



K(r) 



dVr-"e*™'^K(r) 



(15) 
(16) 



r>R 



where these definitions are normalized to agree with 
Eqns (B5) of ISchneider fc Ba rtelmannI (|1997| ). but with 
an alteration in the phase convention. If the cluster po- 
tential at r < i? is described as Eq. ([8]), then, assuming 
no contribution from the galaxy potential ipg , the interior 
and exterior multipoles Q^™) are related to the cluster 
constants as 



qI^\r)^7tR'k, 
QW(i?) = Ji?V:(l-«:,) 



)(2) 
/ out 

1(3) 



(R) - -7r7, (1 - K,) 



-Sc (1 - Kc) 



(17) 
(18) 

(19) 
(20) 



independent of the cluster mass distribution at r > R, 
where k and a are the monopole and dipole mass and 
7 and S are the constant and m = 3 shear terms within 
r = R, respectively, and a* indicates complex conjuga- 
tion. The terms Q^^ and qI^^ vanish (i.e., there are 
no mass distribution within R of these multipoles), since 

K=iVV = OifV^oc r™e±""*, and the Q^^^t term pro- 
duces no shear internal to r < R. These terms have no 
effect upon the lens model or the time delay, and there- 
fore can be ignored. The m = —2 and m = —3 shear 
patterns within R (corresponding to 7c and Sc terms in 
Eq. ^) are generated by a quadrupole and sextupole 
m ass distribution, respectively , external to R (see Fig. 1 
of lBernstein fc Nakaiimall2008[ ) , hence these terms corre- 
spond to the external mass distributions Ql^t ■ If V'g 7^ 0; 



then the appropriate multipole terms Qg^^ or Qg 



for 



the galaxy potential enter each of the multipole terms, 
and the WL measurement constrains the sum of the SL 
galaxy model and cluster terms. 

^ The conventional name for this term is "radial" shear (7r). 
However, a shear which elongates in the radial direction is simply 
a negative tangential shear, and is not orthogonal to 74. The shears 
orthogonal to the tangential/radial directions are aligned ±45 de- 
grees from tangential/radial; hence we flout past conventions and 
rename this the "skew" shear, 73. 
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The multipoles of the convergence k are related to the 
integrals over the complex shear F, Eq. (fT4|) . as 



(frT{r)r 



'Qirii^ (21) 



r>R 



I (frT{v) 

Jr<R 



(m > 0) 
= i?2("-i)Q(™)(i?) (22) 



(m > 1) 

where the integral can be converted to a summation over 
source galaxy shapes as an estimator to the integral. The 
practical estimators avoids summation over i? — > oo or in 
the r < R region (where the weak lensing shear estimator 
breaks down) by integrating over an annular region Ri < 
r < R2: 



- E 



nil 
1 

1 



i?i<rj<fl2 



^r'ol"'(^i)-^2-'Qi7(i?2) (23) 



-Ri<r,<fl2 



Rl'Qln\Ri) - R2'Qin\R2) (24) 



E 

fli<r,<i?5 



^ E ^.(7M+*7.,,)e+^'^' = 



(25) 



Ri<rj<R2 



i?ffi(^2)-i?tQi'l(i?i) (26) 
where the summation is over the jth galaxies at radial 
distance rj and azimuth angles (jjj- In Eq- I123p . the 
summation over 74 is referred to as the E-mode aper- 
ture mass, which corresponds to the physical quantity 
which is the surface mass density, while a summation 
over 7s is the B-mode aperture mass which is expected to 
vanish, and hence provides some measure of systematics 
present in the monopole (similar measures are not avail- 
ab le for the higher-order mult ipole terms, as discussed 
in lBernstein fc Nakaiimall2008f ). The multipoles on the 
right hand sides are that of the combination of the clus- 
ter and galaxy potential; hence the summation constrains 
the combination of the cluster and galaxy multipoles. 

4.7. Redshift Estimation 

Our mass estimators from tangential shear yields esti- 
mates of the convergence, but not the mass sheet den- 
sity S itself. It is important to know the critical sur- 
face mass density Ecrit, which normalizes the convergence 
K = E/Ecrit and is redshift dependent, since the conver- 
gence obtained from WL (at a given source redshift dis- 
tribution) must be converted to an appropriate value for 
SL modeling, where the source is at Zgrc — -Zquasar — 1-41. 

In order to obtain the critical surface density Ecrit for 
WL, the source galaxy redshift distribution must be esti- 
mated. We adopt the magnitude dependent parameteri- 
zation (,Bau gh & Efstathiou 1993) 



^(z, m) oc exp[ 



zo(to) 



j(™) 



)3/2 



(27) 



200 



150 



8 100 



50 




22 



24 26 
magnitude 



28 



Fig. 5.— Distribution of F606W (solid line) and F814W (dotted 
line) AB magnitudes of galaxies used in the WL analysis. The ob- 
jects were detected in the F606W/F814W combined image, and the 
magnitudes in each were determined using the SExtractor double- 
image mode (Bertin & Arnouts 1996). The majority (92%) of our 
objects have F814W magnitude less than 26. 

where ^(2, w) is the magnitude dependent redshift dis- 
tribution, and Zmcd{fn) is the median redshift as a func- 
tion of magnitude m. From the COSMOS^ ACS data, 
iLeauthaud et al\ ()2007l ) obtain the median redshift as a 
function of F814W magnitude as 

2mcd (0.18 ± 0.01) X mF8i4W - (3.3 ± 0.2) (28) 

This relation is valid over the magnitude range 20 < 
mF8i4w < 24; however, their data suggest that the re- 
lation can be extended out to tof814W < 26, based on 
the UDF^ data (Coe et a/. 2006) , which show agreement 
in the 20 < mpsMw < 24 region with COSMOS data. 
Hence we use Equation ([28|) to estimate our median red- 
shift, since our objects are within this magnitude range 
(Figure . 

The final redshift distribution was estimated in the fol- 
lowing manner: 

1. Divide the galaxies into Atof814W — 0.25 bins, 

2. determine the redshift distribution for each bin us- 
ing Eq. ^ and Zmod from Eq. and 

3. sum over each distribution with each bin properly 
weighted. 

The weight of each bin is determined from the weigh t 
each galaxy shape gets in estimating the shear (|BJ02|) . 



^ Cosmic Evolution Survey (| http : //cosmos . astro . caltech . edu/J 

COSMOS is an ACS survey over 1.67 deg^, where a single orbit 
(~ 2000 seconds) exposure is tiled over this relatively wide field 
in F814W. It has 50% completion for sources 0.5" in diameter at 

F814W Iab = 26.0 (Scoville et al. 2003)^ 

^ Hubble Ultra Deep Field (http://www.stsci.6du/hst/udf/ ). 
The UDF data is a multi-color, deep image over a single ACS hcld- 
of-view (11.97 arcmin^) and has a lO-cr limiting magnitude (for a 
0.5" diameter aperture) of 28.4 in the F850LP filter, with 144 orbit 
exposures. The F850LP filter is narrower and less efficient than the 
F814W filter. 
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From the estimated source redshift distribution, we cal- 
culate the mean lensing strength, which is proportional 
to S~[^, and hence to 



/^\ 
\Dos/ 



Jdmjdz^{z,m)w{m)^ 



(29) 



J dm w{m) 

where ^{z, m) is normalized to unit integral, and w{m) 
is the weight per magnitude bin. We find (^%^^ = 
0.572, or the mean weak lensing critical density to be 

C2 1 



47rG DoL 



\Dos/ 



= 3800/1 Mq pc~^ 

(30) 

0.24, and where 



for a flat ACDM cosmology with 
Zlons = 0.355. 

Since the geometry dependence of the gravitational 
lens system is contained in Edit, its value differs for 
the same lens if the source object is at different redshift. 
We compare the critical density Ecrit obtained above to 
E^.it = 335O/iM0pc^^ for the lensed quasar source at 
z = 1.41, and find that the lensing signal, which is pro- 
portional to S~J^, is enhanced by 13% relative to the WL 
galaxy population. In other words, 



<-Q(r) = 



-'crit 



■^crit 



K(r) 



(31) 



where ^'^(r) is the convergence at quasar redshift, Scrit 
and S^.j^ are the critical density for sources at the WL 
galaxy population and quasar, respectively, and K(r) is 
the convergence with respect to the WL galaxy popula- 
tion. 

5. RESULTS 

5.1. Mass Sheet 

The average tangential shears at different radii are 
plotted in Figure [HI The data points, although noisy, 
are independent of each other, so it is appropriate to 
fit a model to these data. We assume a core-softened 
isothermal sphere (CIS) mass distribution 



K(r) 



So 



[l + (r/r,)2]-i/2 



(32) 



and use Eq. ([9]) to compute the corresponding tangen- 
tial shear. Fixing the core radius at Vc — 5", we find 
Eo/Scrit = 0.47 ±0.17, or So = (1800 ± 600)/i M© pc'^ 
(1 a error estimates), with reduced of 0.93. This 
corresponds to a cluster velocity dispersion of it„ — 
420 ± 70 km s"^ jh = 0.7). A fit to a NFW model 
(|Navarro. Frenk fc White! [l996l ) was attempted, but did 
not provide any useful constraint on the concentration. 

Figure [7] shows the aperture mass statistic as defined 
in Eq. (jl2p . with respect to a varying inner annulus. The 
points in this figure are correlated, since each point is an 
integral over galaxies in the annulus from i?max = 186" 
to the inner radii R. The points show the azimuthally 
averaged radial profile of the mass concentration around 
Q0957. Because of the WL mass sheet degeneracy, it is 
not the true radial profile, but a relative value with re- 
spect to a mass sheet averaged within a circular aperture 

of r < i?max- 



From the aperture mass, we find the average conver- 
gence overdensity to be 

Ak = K (< 30") - K (< 186") = 0.122 ± 0.048 

where the radii R = 30" and i?max = 186" are with 
respect to Gl. The average convergence within i?max = 
186" can be estimated from the CIS model, whose fit 
value yields k (< 186") = 0.024 ± 0.012. The error here 
is conservative (i.e., not based on the error in the fit) to 
account for various possible mass distributions. The two 
results can then be combined to yield 



R (< 30'' 



0.146 ±0.049 



From the redshift distribution of the WL source galax- 
ies, this convergence corresponds to a mean mass sheet 
density of 

E (< 30") = (550 ± 190)/i A/qpc~2 

For sources at the quasar redshift z — 1.41, this mass 
sheet then corresponds to a convergence of 

k{< 30") = 0.166 ±0.056 

for the strong lensing analysis (see Eq (|3T|) ) . As explicitly 
stated above, this "mean mass sheet" includes the Gl 
mass averaged over the r < 30" disk; hence to obtain the 
cluster mass sheet Kc, the galaxy mass (modeled from 
SL) must be properly remov ed from the above quantity 
(Eq. (p ll : lFadelv ^Ftll[2009h . 

5.1.1. Comparison with Fischer et al. 

Our result is consistent with the WL results based on 
Canada France Haw aii Telescope (CFHT) images from 
iFischer et al\ (|1997l ). They obtain k{< 30") = 0.16±0.05 
(from their Figure 6) compared to our R{< 30") — 
0.146 ± 0.049, where we have compared the shears as 
obtained from the galaxy shapes, without normalizing 
to the quasar redshift. While the two results and error 
magnitudes are nearly identical, they are obtained in a 
completely different fashion. We obtain the shear cal- 
ibration factor (our responsivity TZ) and propagate the 
shape noise and shape measurement errors from individ- 
ual galaxies into the estimated she ar in a deterministic 
manner based on the formalisms of lBJ02l while Fischer 
et al. calibrate the shear and obtain the error based on 
Monte-Carlo simulations. 

The similarity in the error magnitude could possibly be 
attributed to the similar number of galaxies we use (1651 
and 1866 galaxies for Fischer et al. and this study, respec- 
tively), since shear estimate error is typically dominated 
by the statistical shape noise. Both studies use source 
galaxies in a similar area and magnitude range. How- 
ever, the resolution of the images used in the two studies 
are vastly different: the CFHT images have 0.6" FWHM 
seeing with 0.207"/pixel, compared to our 0.13" FWHM 
with with 0.05"/pixel. Since the images with the smaller 
seeing and pixel scale should reveal the galaxy shapes 
better, the similarity in the results indicates that the er- 
ror is indeed dominated by the statistical error. There- 
fore, in order to improve upon the mass sheet precision, 
we would need to increase the number count of the source 
galaxies, which would be possible if source galaxies of 
lower S/N in the HST images can be utilized. Currently, 
the accuracy in our galaxy shape measurement and shear 
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Fig. 6. — Left: Plot of azimuthally averaged tangential shear, {"ft), centered on Gl, as a function of radius r (solid squares). The dashed 
line is the best fit isothermal sphere with 5" core, with the short-dashed lines indicating the one-sigma deviation of the fit. The B-mode 
("skew") shear (7^) (open squares) is consistent with zero, as expected from shears generated from a centrally distributed mass. Right: 
Radial bins used in obtaining the mean tangential shear. Each dot indicates a galaxy in the 6' X 6' ACS mosaic which are suitable for use 
in the weak lensing analysis, while the annuli delineated by the dotted line indicate the radial bins. The radius of the outermost circle is 
Rmux = 186". The coordinates are centered around the lensing galaxy Gl. 
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Fig. 7. — Left: The aperture mass statistic, which plots the differential convergence calculated from integrating over the tangential shear 
in Eq. II12I I. The solid squares are the tangential shear integrals, the dashed line is the fit to an core-softened isothermal sphere with 5" 
core with the dotted lines indicating one-sigma deviations of the fit, and the open squares are the imaginary components (B-mode aperture 
masses, or azimuthally averaged skew shears) whose signal should be consistent with zero. The points in this figure are correlated, since 
each point is an integral over the annulus from -Rmax to the inner radius R. Right: The same plot in log scale. 

estimation method is limited to high S/N objects (> 25). 
Hence a shape measurement method which can be shown 
to be accurate at low S/N would improve precision in the 
mass sheet measurement, given the same HST images. 

Although the t wo observed convergence are consistent 
with each other, iBernstein fc Fisched (|1999f ) report a 
converge nce of R{< 30 " ) = 0.26 ± 0.08 based on the 
results of iFischer et al\ (|1997f ). when normahzed to the 
quasar redshift. This discrepancy comes from the differ- 
ence in the assumed Scrit values: although the redshift of 
the quasar has not changed, both the assumed WL source 
galaxy distribution and the cosmology are different be- 
tween the two analyses. While we use a magnitude-based 
empirical redshift distribution (i i4.7p with a flat ACDM 



with rim = 0.24, 'F ischer efaJ] ^99T) estimate the red- 
shift distribution based on Monte-Carlo simulation of an 
assumed redshift evolution of galaxy size, along with an 
open CDM cosmology with = 0.1. 

5.1.2. Comparison with Other Cluster Mass Estimates 

IChartas et al\ ()2002[ ) use 2' x 2' Chandra observation 
to determine the mass distribution of the cluster. From 
the spatial distribution of the X-ray luminosity and the 
X-ray temperature, they find the cluster mass to be 

5 + 20 ^ 10"'^^'*'^- -""t^i" f T-Q^^mo /^f n A/Tnr. t^. o'A 



Mq within a radius of 0.5 h^^^ Mpc (~ 2') 



of the cluster center. Our corresponding WL mass es- 
timate within the same radius is (3.3 ± 1.1) x lO^^Mg 
(for h = 0.7), an agreement within ~ la. We note that 
since the mass here is an integral over a radial density 



Weak Lensing Constraints on Q0957+561 



11 



profile, the cluster mass estimate agreement between WL 
and X-ray, which assumes a different radial profile (CIS 
and /3- model, respectively), will differ depending on the 
outermost radius chosen. 

On the other hand iGarrett et al\ (jl992') and 
lAngonin-Willaime et al\ (|1994f l obtain a cluster member 
velocity dispersion of = 715 ± 130 km s~^ from red- 
shifts for 21 probable cluster members, compared to our 
ay equivalent of 420±70km s^^ [h = 0.7). The dispersion 
data correspond to cluster mass of (17.8 ±6.5) x IQ'^^AIq 
{h = 0.7) within a Q.bhj^ Mpc radius. The 2a discrep- 
ancy between the WL and velocity-dispersion mass es- 
timates suggests that the velocity dispersion is possibly 
highly anisotropic, with an enhanced peculiar velocity 
along the line of sight. lAngonin-Willaime et al\ (|1994f ) 
suggest that cluster member selection could also be the 
cause for the large dispersion, where they find removing 
a single galaxy at the edge of the velocity distribution 
reduces the velocity dispersion to 660 km/s. 

5.2. Multipole Moments 

The rest of the (differential) multipole moments as ob- 
tained from Eqns. ([24l - l26|) are listed in Table [l] where 
the differential is between the values at i? = 30" and 
^max ~ 186". Figure [8] plots the multipoles with respect 
to the value at i?max for various inner radii R, which 
highlights the problem with constraining the Qo™^ terms. 
Due to the powers of R involved in the summation, the 
shear signal, as well as the shape noise, are weighed by 
2 for q|™\ and by i?2™-2 q(™)^ 
the outer region (dominated by shape noise) are weighted 
less, and the signal at smaller radii can be seen (Fig. [8] 



left). At Q, 



> 1, (Fig. [Hcenter and right), whatever 



signal existing at the inner radii is overwhelmed by the 
shape noise in the outer radii, and hence the differential 
signal as R decreases to 30" is merely noise which domi- 
nates at larger radii. This effect is more severe for Q^ut 
(Fig. [8] right), where higher powers of R are involved, 
that the shear from R < 100" hardly contributes to the 
integrated shear. 

If we are to constrain the multipoles at i? = 30", it is 
also necessary to have a handle on their values at -Rmax- 
This value is estimated to be up to 20% of the values 



at R — 30" for the Qout Qmlt terms, and around 
1% for the qIjJ"* term, for a CIS cluster whose center is 
displaced from the Gl center by 5" to 30". Since our 
multipole signals are already significantly smaller than 
the measurement errors, we deduce that the correction 
from the i?max term is significantly smaller than the er- 
rors if it arises from a cluster that resembles a CIS profile 
beyond i?,„ax- 

Ignoring the i?niax terms from the cluster and assuming 
the lensing potential as stated in Eq. ([8]), the cluster 
expansion coefficients (normalized to the quasar redshift) 
are constrained to 



(3) 



(1 - K,) a, -I- ^(1 - n,) i?r'QlnL(^i) - R2'Qlnlm 



(-0.0018 ± 0.0042) + i(-0.0031 ± 0.0052) arcsec 



(1-Kc)7c+-(1 
TT 



Ri 



{R2) 



= (0.00 ± 0.34) + i{+0.09 ± 0.33) 
2 

(1 - Kc)Sc + -(1 - Kc) 
TT 



Qiut,g(^l) 



'R2 

■.Ri . 



'3iut,g(^2) 



= (+0.115 ± 0.098) + i(-0.047 ± 0.097) arcsec'^ 
As explicitly stated above, the galaxy model contribu- 
tion Qg™'' to each moment must be subtracted from the 
results to obtain the cluster moments 7c, ctc and 6c', the 
measurements constrain only the sum of the galaxy and 
cluster contributions. 

As seen in Figure [51 the error in the 7c and Sc terms 
do not converge to any sufficient degree. However, we 
attempt to constrain 7c by the following means: Since 
this term is simply the constant shear across r < 30", 
we measure the average of the (WL) shear within this 
region. However, since the central portion of this region 
is strongly lensed, we average the shear signal over an 
annular region of 20" < r < 40" to estimate the con- 
stant shear within r < 30". We find the average shear, 
normalized to the quasar redshift, to be 



(1 - Kc)7c+-(1 - Kc) 
TT 



?iut,g(^l) 



R2 

Ri 



Q^olt.giR^) 



= (-0.009 ± 0.045) + i(+0.092 ± 0.045). 

This allows us to shrink the error bar considerably, and 
by doing so we have assumed that 7c is generated exclu- 
sively by mass exterior to 40". The directionality of the 
constant shear term is consistent with an exterior mass 
quadrupole distribution with positive weight along the 
Northeast/Southwest direction with respect to Gl. 

6. CONCLUSION 

The Q0957-I-561 gravitational lens system allows for a 
determination of the Hubble constant Hq, based on its 
firm time delay between the double images of the quasar; 
the remaining uncertainty originates from our knowledge 
of the lens mass distribution. The lens consists of the 
galaxy Gl and the 0957+561 cluster to which the galaxy 
belongs. WL offers complementary constraints to the 
lens mass provided by the SL modeling analysis, by pro- 
viding the mass sheet and multipole moments of the un- 
derlying cluster. We have developed and utilized the for- 
malism for an exact solution for aperture mass multipoles 
from WL shear data in the thin-lens approximation. 

The mean convergence in the SL region r < 30" is 
estimated to be k = 0.166 ± 0.056 (1 a) normahzed to 
the quasar redshift, based on shear data from the annu- 
lar region 30" < r < 186", where the radii are centered 
on the lens galaxy Gl. Althou gh our shear measure- 
ments are consistent with iFischer et al\ ()1997t ). the new 
quasar-normalized R is more reliable since they are based 
on better data, a more accurate galaxy redshift distribu- 
tion, and a standard cosmology. Our results give a 7% 
precision in the mass sheet degeneracy term (1 — k). 

The uncertainty in the external multipole terms 7c 
and Sc are too large to provide useful constraints to the 
lens potential ijj within r < 30". However, the constant 
shear term (1 — R,)jc within r < 30" can be estimated 
by straightforward averaging of the shear in the WL re- 
gion at 20" < r < 40". The internal dipole {1 - R)\ac\ 
(whose galaxy contribution is expected to be negligible) 
has a value of 0.006 ± 0.006, compared to the Monte 
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Ri'^QUHRi) - R2^Q'-^\R2) = [(-1.2 ± 2.9) + i(+2.1 ± 3.6)] x IQ-^ arcsec"! 
RlQo-ltiRi) - RWoliRl) = [(+0-1 ± 8.5) + i(-2.3 ± 8.2)] X 10^ arcsec^ (*) 
^^Qiut(-f?2) - RjQ'-oliRl) = [(-1.3 ± 1.1) + i(-0.5 ± 1.1)] X IQS arcsec3 



TABLE 1 

Table of multipole values, where R2 = iJmax = 186", and Ri = 30". These values correspond to the innermost points 
PLOTTED IN Figure [S] and hence are normalized to the WL source redshifts. (*) There is an alternative means of obtaining 

the 7c [i^^Qo^t] term, via STRAIGHTFORWARD AVERAGING OF THE WEAK SHEAR VALUES WITHIN THE ANNULUS AROUND R = Rl (SEE 

text). 
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Left Differential R~'^Q^^\r) with respect to the value at Rn 



100 

R (arcsec) 



200 



Center. Differential R'^q'^^^(R). Right: Differential 

R Qouti^)- In all three fi gures, the solid squares indicate the real components, while the open squares indicate the imaginary components. 
The complex numbers encode the multipole orientation, where the axes are aligned with West ("x") and North ("y"). The internal 
mass dipole term has a directionality t hat points approximately to the Northeast with respect to Gl, consistent with previous studies 
lIFischer et aL|[T997l: FChartas et aLI|2002| ). 



Carlo parame t ric fit values ~ 0.011 ± 0.007 obtained by 
iKeeton et al\ ()2000D . The implications for Hq with the 
full SL lens modeling are discussed in an accompanying 
paper (|Fadelv et aLll2009t ). 
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